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We present a permutation-invariant distance between atomic configurations, defined through a functional 
representation of atomic positions. This distance enables to directly compare different atomic environments 
with an arbitrary number of particles, without going through a space of reduced dimensionality {i.e. fin¬ 
gerprints) as an intermediate step. Moreover, this distance is naturally invariant through permutations of 
atoms, avoiding the time consuming associated minimization required by other common criteria (like the Root 
Mean Square Distance). Finally, the invariance through global rotations is accounted for by a minimization 
procedure in the space of rotations solved by Monte Carlo simulated annealing. A formal framework is also 
introduced, showing that the distance we propose verihes the property of a metric on the space of atomic 
configurations. Two examples of applications are proposed. The hrst one consists in evaluating faithfulness 
of some hngerprints (or descriptors), i.e. their capacity to represent the structural information of a config¬ 
uration. The second application concerns structural analysis, where our distance proves to be efficient in 
discriminating different local structures and even classifying their degree of similarity. 


I. INTRODUCTION 

Since one decade, the comparison of two atomic struc¬ 
tures has raised a growing interest in the fields of biol¬ 
ogy, physics and chemistry. The term « atomic structure » 
embodies a wide variety of situations (molecules, crys¬ 
tal, fluids, etc) and refers in a general sense to a set of 
atoms. For a molecule, it corresponds to the positions of 
its constituting atoms. For a condensed matter system, 
it may be the positions of a given atom and its neigh¬ 
bors. The methodologies developed to compare two sets 
of atoms draw their diversity from the variety of consid¬ 
ered applications, for example comparing molecules^"— 
or crystal structuresii^^— , performing the Minima Hop¬ 
ping method^i^ and more recently Machine Learning ap¬ 
proaches used for numerical potentials and forcesi^— . 

In all these fields, efforts have been made to give a 
permutation, rotation and translation invariant measure 
of similarity (or distance) between atomic configurations. 
Indeed, it is of paramount importance to provide a com¬ 
parison of structures having these invariances, since the 
studied properties do not depend on the ordering of 
atoms and the orientation of axes. In biology, rotations 
are defined along axes passing through the center of mass 
of the molecule, whereas for numerical potentials it is 
around the atom for which we aim at calculating the en¬ 
ergy or forces. For example, if a particle has an energy 
that depends on all its neighbors, applying a global ro¬ 
tation to the system does not change this energy. In the 
same way, the local structure of a crystal (such as Face 
Center Cubic, Cubic Center, etc) does not depend on the 
choice of axes or the indexing of neighbors. 

In the field of Machine Learning and crystal recogni¬ 
tion, several approaches rely on the use of functions often 
called fingerprints or sometimes descriptors, which rep¬ 
resent the structural information of an atomic conhgu- 
ration. A comparison between structures then reduces 


to the comparison of their associated hngerprints. A 
possible choice of hngerprints is based on the eigenval¬ 
ues of matrices depending on inter-atomic distances be¬ 
tween the atoms in the system^. In this framework, sev¬ 
eral matrices depending on distances between neighbor¬ 
ing atoms can be used, such as Weyl matricesi^. Coulomb 
matrice a^d^ or Kohn-Sham Hamiltonian matrices, Hes¬ 
sian matrices or overlap matrices^. Other representations 
can be used, such as the symmetry functions of Behler 
and Parrinelloiiii^iii, bond order parametersi^, power 
spectrum, bi-spectrum and 4D-bi-spectru m^^d^ , or even 
bit-strings^. Nevertheless, beyond the fact that these 
representations are often arbitrary, they are generally in¬ 
trinsically incomplete. Indeed, if a system is constituted 
of n atoms, it has 3n degrees of freedom. All the methods 
relying on eigenvalues of matrices^i^d^ compare at most n 
eigenvalues, and cannot entirely represent the systemic. 
Therefore, it is in general delicate to show that calcu¬ 
lating the distance between hngerprints corresponds to a 
genuine distance in a mathematical sense. 

Another method used to dehne a distance between 
environments is the Rmsd {Root Mean Square Dis- 
tance)r^. This distance is simply the square root of 
the sum of square distances between the atoms in each 
environment. It is indeed a distance from a mathemat¬ 
ical viewpoint, but suffers from two main drawbacks. 
First, the two conhgurations must have the same number 
of particles; this restriction is acceptable when compar¬ 
ing molecules but it is unacceptably restrictive in the 
context of structural analysis of condensed matter sys¬ 
tems. Moreover, this distance is not permutation invari¬ 
ant, which means that all permutations should be tried in 
order to compare two conhgurations, and for each permu¬ 
tation the optimal rotation should be calculated. This is 
feasible with some advanced Monte Carlo approach^, but 
the calculations cannot be carried out for large systems. 

The limitations induced by these methodologies call 
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for new efforts to define a distance that can be applied to 
large systems with an arbitrary number of atoms, for 
subsequent use in Machine Leaning method o^^'^'^ . In 
this context, a direct measure of similarity has been in¬ 
troduced by Bartobii, the SOAP {Smooth Overlap of 
Atomic Positions). It does not depend on any finger¬ 
print and does not require to explore the set of permu¬ 
tations. In this article, we generalize the notion of rep¬ 
resenting a configuration by a probability density, and 
define new permutation-free distances between configu¬ 
rations by appropriately generalizing Soap and Rmsd. 
We introduce in Section m a formalism from which we 
derive a distance between configurations. The first task 
is to give a mathematical definition of what we will call 
environment and configuration in Section III Al We dis¬ 
cuss in Section fllBI the Functional Representation of an 
Atomic Configuration (Frac), from which we derive a 
distance on the space of configurations in Section m 
first for single chemical element systems, and then for 
multi-elements ones. Several extensions of the methods 
are also proposed in Section IIII Cl We next provide in 
Section 113 some examples of applications. The first one 
is the evaluation of the faithfulness of fingerprints to rep¬ 
resent the structure of a chemical atomic environment. 
The second application aims at proving the efficiency of 
this distance in the context of structural analysis. 


II. FRAMEWORK 

A. Atomic Environment and Configuration 

We first give a mathematical framework to what is 
usually called an atomic configuration or environment, 
in order to define a distance on such a space. Indeed, 
a configuration is usually considered as a simple set of 
vectors (gi)r=ii 9® ^ in the Rmsd framework 

for example^i. Nevertheless, this representation is not 
adapted when dealing with condensed matter systems, 
since the number of neighbors may vary. In the follow¬ 
ing, S is the space of permutations and TZ the space of 
rotations (we use the same notation for rotations of 
and rotations of a rigid body), i.e. matrices R such that 
R = RR^ = I and deti? = 1. For now, we restrict 
ourselves to systems constituted of a single chemical ele¬ 
ment, an extension to multi-elements systems being pro¬ 
vided in Section [Tll PI 

First, an environment constituted of n atoms is a set 
of n vectors of R^. We therefore define 

C" = {(%)”=! I V^, fteR^}. 

However, we want to consider environments with an ar¬ 
bitrary number of atoms. One possible appropriate defi¬ 
nition of an atomic environment is then 

C=\Jr. (1) 

n>l 


The positions are defined with respect to the center of 
frame which depends on the application at hand. This 
allows to get rid of spurious translation invariances. 

In order to describe an environment with particles in 
a radius i?cut around the origin, we restrict the set of 
admissible environments to environments having all their 
elements within a distance Rent of the origin: 

Ccut=UC-t, (2) 

n>l 

with 

Cc”ut = {fe)r=i I V*, fteR^ llftll <^cut}. 

We will see that only small adaptions are required to 
use Ccut rather than C. We therefore work in C with no 
restriction, and make precise the adaptions required to 
work with Ccut when necessary. 

Several atomic properties like the potential energy in 
classical simulations, or the local structure, are defined 
with respect to the environment (i.e. the set of surround¬ 
ing particles) of a given atom. As these properties do not 
depend on global translations, rotations or permutations 
(or equivalently on an arbitrary choice of axes and or¬ 
dering of particles), an appropriate definition of an envi¬ 
ronment should retain these invariances. For an environ¬ 
ment C = (9i)r=i G C, a permutation cr G 5, a rotation 
R € TZ, we define crC = {qa(i))'i=i and RC = (RgOfci- 
Two environments Ci and C 2 are considered equivalent 
if one is a rotation and/or a permutation of the other. 
This suggests the following equivalence relationship: for 

Cl = {qi)Zi eC,C 2 = (gO/di G C, 

ni = 712 and 3 (tG5, 3R G TZ 
Cl C 2 _ aRC2. 

To define what we call a configuration in the following, we 
gather in classes all the environments that are equivalent. 
As a result, the space of configurations reads: 

C=C/^. (3) 

This means that each element C G C is the ensemble of all 
possible rotations and permutations of an environment. 
From now on, we will call environment an element of 
C and configuration an element of C. Once again, two 
environments Oi and C 2 that differ by a rotation or a 
permutation belong to the same class (configuration) C, 
and are therefore understood as identical inj;he^ense of 
configurations. In practice, a configuration C G C can be 
represented by any of its elements C G C (an example of 
two environments representing the same configuration is 
displayed in Figure[T]). This definition is motivated by the 
fact that in many applications, for example calculating 
potentials or comparing molecules, only the structure of 
the environment matters, and not its orientation with 
respect to arbitrary axes or the ordering of its elements. 
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We now define a functional representation of an en¬ 
vironment. This representation turns out to be permu¬ 
tation invariant, and is a fundamental tool to develop in 
Section|TTT]a distance on C, or equivalently a permutation 
and rotation invariant distance on C. 



X 


Figure 1: Example of two environments Ci and C 2 
representing the same configuration C in two 
dimensions. 


B. Functional Representation of Atomic 
Configuration (Prac) 

In this part we still consider systems with a single 
chemical element. Our goal is to represent a set of par¬ 
ticles by a smooth probability density that can be ma¬ 
nipulated more conveniently than a set of vectors. The 
idea is that a set of n particles at positions (gOfci 
be exactly represented by a set of Dirac functions 5q^ at 
their positions: 
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P{9l,....9n} = (4) 

^ i=l 

This expression is convenient since it is invariant with 
respect to permutation of atoms: we exploit this prop¬ 
erty in Section imi But this representation is unusable 
in practice, because the spectral expansion of a Dirac 
delta function is too slowly convergent. Approximating 
the Dirac delta function requires basis sets of very large 
dimension. It is more convenient to smooth out and 
represent an environment by a density Pm where cr is a 
smoothness parameter, as in Refs. [13 and fTol. This den¬ 
sity Pa can be interpreted as a density of mass, or as a 
probability density of presence, the particles’ positions 
being considered as realizations of a random variable in 
Asa result, we need to define appropriate functions p 
to represent the presence of one atom at position 0 (other 
locations are obtained by translations of such functions). 
These functions should verify the following properties: 

• regularity, e.g. p is continuous by parts; 



(a) Atomic environment with 
central atom (blue) and 
neighbors (red) 


(b) Associated functional 
representation 


Figure 2: Example of atomic environment for a fluid 
with associated Frag 


• (^ is bounded with unit mass, i.e. 0 < < 1 and 

• is positive and reaches its maximal value at g = 0; 


• v{<l) 


> 0 . 


O'—>-+oo 


Note that we could restrict ourselves to more regular 
functions, for example infinitely differentiable ones. Sev¬ 
eral choices can be made, as made precise in section ITlI Cl 
but for the sake of clarity we consider in the sequel the 
Gaussian case. The choicei^ 

= (27rcr2)-i exp (5) 


verifies the required properties to represent the presence 
of an atom. A configuration C = (gi)r=i ^ ^ ^e 

represented by the density 

- n 1 

n ^ n 

i—l 2=1 

As an example, the functional representation of an 
atomic environment of a fluid with such a function is 
shown in Figure [2 The asymptotic behavior of this 
density shows that the functional representation ([51) is 
a smoothed representation of atomic positions. Indeed, 
the Gaussian function tends to a Dirac delta function 
when cr tends to 0. As a result 

^ Ti 1 ^ 

Per ^ ^ P{qi ,...,qn} ^ ^ ^qi : 

n ^' cr-rO LI ’ J ^ Z—/ 

i=l i=l 

in the sense of distributions. More details about this 
functional representation are provided in section IIII Cl 
In some applications, it is convenient to consider only 
atoms within a cut-off radius of the origin. We introduce 
to this end a decreasing cut-off function /cut : K i—>■ R 
such that 


/cut(O) = 1, fcut{r) = 0 for r > i?cut- 
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For instance, Parrinello and Behler— choose 


C 2 




GC: 


fcut{r) 



r* ^ -^cut, 
^ ^ Rent- 


( 7 ) 


Of course other options may be considered, such as 
feutir) = 1{ i'<-Rcut}(''’)■ framework, an environ¬ 

ment is now represented by its smoothed weighted den¬ 
sity: 


Ph.A<i) = ^ -■ (8) 

i=l 

The denominator in ([5]) is introduced in order for 
to be a probability density {i.e. with unit mass). From 
a physical viewpoint, it shows that we do not attach 
the same weight to each particle: atoms far away from 
the center have a smaller weight since they should con¬ 
tribute less to the description of the environment. It 
represents an effective number of particles in the neigh¬ 
borhood of the considered atom. All the following results 
are presented with the representation ([S]) but can be eas¬ 
ily adapted with (0. 

Let us emphasize some advantages of the Frac. First, 
it is invariant with respect to permutations of atoms of 
an environment. Secondly, it provides the same type of 
representation (a function on whatever the number 
of atoms, which allows to compare configurations with 
different numbers of atoms. Finally, a distance between 
configurations can be naturally defined in this functional 
framework. 


d2 {Ci,C2f 


1 
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nin2 


ni n2 p 

EE / ^qM)^q'M^^ 


i=l j=l 
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In order to construct a rotation invariant distance, we 
take the infimum over rotations of one representing en¬ 
vironment, as done for the Rmsd ^^i^^ . For two configu¬ 
rations Cl, (72 £ C, we choose two representing environ¬ 
ments Cl £ Cl and C 2 £ C 2 belonging to each of these 
configurations and define: 



= inf d 2 {Ci,RC 2 ) 
Ren 

= inf d 2 {RCi,C 2 ), 


( 12 ) 


which is a distance on C (see Appendix lAl). In practice, 
we compare one environment Ci to another C 2 with CH) 
and estimate the rotation minimizing this quantity, as in 
a Rmsd frameworb^i. Nevertheless, there seems to be no 
simple expression of this optimal rotation, contrarily to 
the Rmsd cas o^^i^^ for which the optimal rotation is ob¬ 
tained as solution of a singular value decomposition. We 
describe a procedure to numerically estimate the optimal 
rotation in section UlI B1 


III. ATOMIC CONFIGURATION DISTANCE 
(ACD) 


A. Single Element Systems 

Once a Frac has been defined for systems with one 
chemical element, a natural way to derive a distance be¬ 
tween configurations is to define a distance between their 
associated densities. A classical choice is the distance 
since it relies on a standard scalar product. Indeed, defin¬ 
ing as in Ref. the overlap integral 


S{pi,p 2 )= P 1 P 2 , ( 9 ) 

the distance reads 

\\Pl - P2\\l2 = S{pi,pi) - 2S{pi,p2) + S{p2,P2)- ( 10 ) 

By identification between an environment and its density, 
we define the following distance, for Ci = and 


The Smooth Overlap of Atomic Positions (Soap) in¬ 
troduced in Ref. also consists in taking a Gaussian 
function for as in ([5]) but averaging over rotations by 
decomposing over spherical harmonics. Here, we also 
choose Gaussians, but rather consider the infimum over 
rotations. The idea is drawn from the Rmsd, and enables 
to define a permutation and rotation invariant distance 
on C, or equivalently a distance on C. Moreover, in the 
Gaussian case, the overlap integral ([9|) has an analyt¬ 
ical expression which alleviates the need for numerical 
quadrature. Indeed, the following formula holds (see Ap¬ 
pendix |B]): 


S{pi,P2) 



nin2 


ni 712 

EE®^p 

i=l j=l 




( 13 ) 


with K = 


8(7rcr^) 2 . 


The permutation invariant measure 
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of distance based on (El) then simplifies as 


d2{Ci,C2f 



ni 

H exp 


4a2 J 


2k-^ 

nin2 


ni 712 

i=i j=i 




(14) 




+ 



n2 

i,j=l 


4 ct 2 j ■ 


A permutation and rotation invariant distance is finally 
obtained with (1121) . This is the formula we use in the ap¬ 
plications presented in this work, with the cut-off func¬ 
tion as given in Section lll B1 Moreover, asymptotic results 
when tr —)> 0 and a -l-oo are studied in Appendix [C] 
Finally, as noticed in Ref. M the quantity 


S{Ci,C2) 


1 

nin2 


ni 712 

EE exp 

i=l j=l 




is a permutation invariant measure of similarity between 
atomic environments. Setting 


S{Ci,C2) 


snp S{Ci, RC 2 ) 

ReTZ _ 

^S{Ci,Ci)S{C2,C2y 


we define a kernel on the space of configurations C that 
can be directly used for Machine Learning approaches, 
as SOApi^. The supremum ensures that the optimal ro¬ 
tation maximizes the similarity of the structures. 


B. Optimal Rotation 

From a practical viewpoint, we search the minimum 
in m by parameterizing a rotation R in terms of the 
Euler angles2ii2^ (a,/3,7) , but other choices could be 
made, such as quaternions22ii^. The three angles corre¬ 
spond to three rotations around different axes. We use 
rotations of angle v over axes x and z, defined by the 
following rotation matrices: 

/I 0 0 \ 

Rx{v) = 0 cos(i^) -sm(i/) , 

yO sin(i^) cos(z/) j 


(a) Environment Ci 


(b) Environment C2 


Figure 3: Example of two environments Ci and C 2 
representing the same configuration C in three 
dimensions. 


dimensional compact space. We use a Monte Carlo pro¬ 
cedure (simulated annealing2Si21, see Appendix |D|) to 
search for the minimum. 

In order to test the method, we fix two environments 
Cl, C 2 with 4 particles in each one and Ci = (JRC 2 for 
some rotation R and permutation a (see Figure ED , so 
that Cl and C 2 represent the same configuration. Then 
we study the mapping J : >->■ d 2 {Ci, Ra,p^-yC 2 )- 

In Figured! we display isosurfaces of J. We see that the 
minimum of this function is indeed 0, and that several 
rotations meet this minimum (given that the system has 
several symmetries). Moreover, a realization of simulated 
annealing is plotted, showing that the global minimum 
is found by a numerical method. As a second example, 
two environments Ci, C 2 of a Lennard-Jones fluid (see 
Appendix |F]) are considered. An example of such an en¬ 
vironment is represented in Figure Ej (a). In Figured! we 
plot isosurfaces of J with a realization of simulated an¬ 
nealing for these environments. Here, two local minima 
are found, but only one is the global minimum. 

We studied more specifically the case of the Gaussian 
Frac with L2 distance, since the analytical formula (flTl) 
allows to dramatically reduce the computational cost of 
its evaluation. In this case, the parameter cr, if not es¬ 
sential from a theoretical viewpoint, plays a role in nu¬ 
merical applications. Indeed, changing a does not change 
the position of the global minimum, but affects the gra¬ 
dient of function J. A large value of a leads to a shallow 
function, whereas a small cr leads to a peaked function J. 
Therefore, the numerical efficiency of the optimization 
procedure depends on the value of this parameter. 


C. Generalizations 


( cos(i^) — sin(z/) 0\ 
sin(i/) cos(j^) 0 . 

0 0 ij 

A rigid body rotation^iiS^ is then defined by Ra,/ 3 ,-y = 
Rz{l)Rx{P)Rz{.oi) with {a,l3,'y) € [0, 27r] x [0, tt] x [0, 27r]. 
Therefore, the optimization is performed in this 3- 


We considered in the previous section the particular 
case of Gaussian functions tp for representing atoms. Let 
us however emphasize that other choices are possible and 
in fact many other generalizations of Frac can be pro¬ 
posed. 

First, several choices are available for representing the 
presence of one atom, for example in dimension 1: 
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Distance 

0,036 

Po,03 

; 

: 0.02 

lO.Ol 

0 


Figure 4: Isovalues of the mapping 
J : {a,P,^) !->■ d 2 (Ci, Ra,/3^jC2) when Ci and C 2 belong 
to the same class. The points represent the iterates of a 
simulated annealing procedure. 


I g + cr, for g G (-cr,0), 

• V’^iq) = I -g + cr, for gG (0,0-), 

[ 0, otherwise. 

The idea always is to put weight in the vicinity of the 
central atom, with more or less smoothness and accu¬ 
racy. More formally, this reduces to using an approx¬ 
imation of a Dirac function in the sense of distribu¬ 
tions. The indicator function in dimension 3, = 

(|7r(7^)“^l{||q||<o.}(9)i represents the atom as a volume 
in space. Other functions represent a mass density van¬ 
ishing more or less smoothly. Another interpretation 
is probabilistic. We can consider that the positions of 
neighbors are realizations of a random variable drawn ac¬ 
cording to a density y. In this case, the density (O with 
the choice <^'^( 9 ) = corresponds to 

a histogram in approximating this measure x- The 
Gaussian choice corresponds to the Gauss kernel approx¬ 
imation of X, which is a particular case of non-parametric 
estimation of a probability density^i^S, and can be inter¬ 
preted as a smoothed histogram. 

From a formal point of view, the measure defined 
in ([H]) corresponds to the convolution of the measure 
P{qi,- -,qn} defined in ([4]) with the shape function tp®', 
which reads: 



Distance 

0.005 

|o.0048 
Jo, 0044 
jo,004 
|0,0036 
jo,0032 
0,0029 


Figure 5: A realistic example, two configurations of a 
fluid. The wire frame represents level sets of the 
mapping J : (a,/3,7) 1 -^ d 2 {Ci, Ra,p,-yC 2 ) and the points 
a simulated annealing procedure. 





Paiq) = ^^{\\q\\<a}iq) 


P<r{q) 

= / P{qi,...,q„}iq')p'"iq-q'W- 

As a result, the function can be interpreted either as 
a way to represent an atom or as a shape function in a 
regularizing convolution of the measure P{qi,...,q„}- 

Another possible extension of the method is to con¬ 
sider different distances between densities, such as LP{w) 
distances where p G N* and w is typically a rotation in¬ 
variant non-negative weight function with unit mass. In 
this case, 


||pi - P2\\lp{w) = \pi - P2\^v?j . (15) 

The weight w can be used for example to give a higher 
importance to some region of space. In our case, the 
choice p = 2 is reasonable not only because it is coherent 
with the distances generally used (for example between 
sets of vectors for the Rmsd), but also because it relies 
on a scalar product that proves to be convenient for cal¬ 
culations. On the other hand, no analytical expressions 
are in general available for the extensions listed so far. 
They are, as a result, more computationally expensive in 
practice. 

It is nonetheless possible to consider another way to 
generalize Acd, by directly starting from (fOl) and re¬ 
placing the Gaussian function by any shape function 
verifying the properties outlined in Section IIIBI (typi¬ 
cally one presented at the begining of this section). This 
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amounts to considering 


S{C^,C2) 


1 

nin2 


ni n,2 




The function now plays the role of a correlation be¬ 
tween two particles. If the particles are close with respect 
to the bandwidth cr, their distance is small and is typ¬ 
ically close to 1. If they are distant, is close to 0. This 
opens a wide range of perspectives for constructing dis¬ 
tances for which the same conclusions as in Section m 
can be drawn. 


D. Multi-elements Systems 

We now extend the previous results to systems of p 
chemical elements, which offer a wider range of applica¬ 
tions. In our framework, a natural way to represent an 
environment of p chemical species is to represent it as a 
product of environments of individual species: 

CP = Cx...xC. (16) 

p times 

The only issue is to correctly describe admissible rota¬ 
tions and permutations of such a system. For a multi¬ 
element environment C = (C^, C^,..., C^) G C^, with 
Hi elements for each C*, we define a permutation of the 
system a = (ai, a2, ■ ■ ■, ap) G x ... x as 

aC= (aiC\a2C^,...,apCP), 


where d 2 is defined (fTH) in the Gaussian case. This mea¬ 
sure of distance is permutation invariant thanks to the 
Frac framework. Now, to make this distance rotation 
invariant, we again minimize over rotations. For two con¬ 
figurations Cl, C 2 G CP, represented by two environments 
Cl G Cl and C 2 G C 2 , we set 




d2iClRC^^)C 


(17) 


Note that since rotations act in the same way on all sin¬ 
gle environment {i.e. for each chemical element), the 
minimum is taken over the sum of distances with the 
same rotation for each species. If one wants to give var¬ 
ious importance to the different elements, a family of 
distances may be introduced with a vector of positive 
weights w G (R+)*’ with by setting 



inf 


\ 


P 

w,d2{Cl,RC^2)^. 


We can check that d^ verifies all the properties of a dis¬ 
tance on CP as in the single element case. Of course, 
an extension to configurations in a cut-off radius Ccut is 
straightforward following the discussion of Section IIIBI 


IV. APPLICATIONS 
A. Faithfulness of fingerprints 


and a rotation RgTZ as 

RC= {RC\RC'^,...,RCP) . 

First, this means that permutations are allowed only 
between elements of the same chemical element (and 
not with other species). Secondly, rotations are de¬ 
fined on the whole system, so they modify each en¬ 
vironment in the same way. It is then straightfor¬ 
ward to define the equivalence relationship ~ as in Sec¬ 
tion EAI For Cl = i{q})Zi,---A<lf)Zi) e CG C 2 = 

\/ i < p, rii = rrii and 3a G Sn, ^ R G TZ 

Ol ~ 09 

such that Cl = aRC 2 . 

We next define Cp = CP^^. In other words, two environ¬ 
ments belong to the same class if and only if one can be 
written as a rotation of the other up to a permutation of 
atoms in each species. Now, we can introduce a distance 
between two multi-elements environments as the sum of 
distances for each single element, that is: 

dP(Ci,C2)2 = d2{,clclf + . . . + d2(Cf, CP)\ 


Fingerprints appear in several areas of chem¬ 
ical physics, for example for representing Po¬ 
tential Energy Surfaces (PES) with Machines 
Learning approache o^^d^iiSiiTpg local structure 

recognition^ddid^. The problem with these methods 
is that two environments that differ by a rotation or a 
permutation are understood as different inputs for the 
model, and may lead to different outputs, whereas the 
energy and the forces remain unchanged. Moreover, 
the dimension of the input vector has to be constant, 
and should not depend on the number of neighbors. 
Therefore, fingerprints (or descriptors) generally consist 
of functions of neighboring atoms, invariant by rotation 
and permutation. They are supposed to characterize 
the structure of an atomic environment. Formally, 
they can be defined as a mapping <I> : C 1 —>■ R™ such 
that for any permutation a G S and rotation R G TZ, 
it holds ^{aRC) = d)(C). We already mentioned 
in the introduction some appropriate descriptors for 
such applications, especially the ones based on eigen¬ 
values of matrices^ii^d^, bond-order parameters and 
bi-spectru m^^d^ and symmetry function o^^d^d'^ . 

As an example, we chose to study the faithfulness of 
representation of two types of symmetry function o^^d^d*^ , 
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is the angle associated with the triplet made of the 
central particle and its neighbors j, k. Moreover, /cut 
is chosen as in Q and Rg, A and / are real parameters. 


but all the cited descriptors could be tested. We in¬ 
vestigate the relevance of two-body functions and 
three-body functions (see cinci (jl9p below). 

The sums run over the neighboring atoms and 9jk 


= (18) 
i=l 


G^ = 2 ^-^ (1 

j,k=l 


Acos(0,fe))fe-''(ll^^-ll'+ll«'=ll'+ll«^-^'“ll')/cut(lk,||)/cut(||gfe||)/cut(||g,^ 


(19) 


The coordinates of $ in K™ are then obtained by cal¬ 
culating the functions G^ and G^ for various values of the 
parameters ry, Rg, A, (. One can easily check that d) is 
invariant under rotation and permutation. Our goal is to 
study the relevance of this choice of fingerprint <&: Is this 
representation able to discriminate different structures ? 
What are the optimal parameters? How many functions 
should be used? 

Our strategy is to estimate for various couples of con¬ 
figurations Gi , Gj the distance in the space of descriptors 
||$(C'i) — $((7^)11, and the distance in the Cartesian space 
d 2 {Gi,Cj), before computing their correlation. We will 
then be able to understand whether the representation 
is ambiguous or not: typically, a wrong representation 
would lead to high Atomic Configuration Distance d 2 
(Acd) and low fingerprint difference, or conversely low 
Acd and high fingerprint distance. 



Descriptor Distance 

Figure 7: Correlation between Acd and descriptor 
distance for a new set of fingerprints, tr = 2 



Figure 6: Correlation between Acd and descriptor 
distance for two fingerprints, a = 2 


We generate a set of configurations Ci of a Lennard- 
Jones fluid with a molecular dynamics simulation (see 
Appendix | f 1) . As a first example, we apply the procedure 
by taking only one G^ and one G^ functions to represent 
the environment, with i?cut = 8.52 A, 77 = 0.5, Rg = 0 
A, A = —1 and C = 1- Figure [HI we observe the ex¬ 
pected correlation between the descriptor distance and 
the Acd for this choice of descriptors but with an impor¬ 
tant dispersion. After a manual trial and error search, 


we found a set of functions giving a better correlation 
(see Appendix lEl) . as displayed in Figure [T] To quan¬ 
tify the increase of faithfulness of the fingerprints, and 
given that the correlation seems linear, we perform the 
least square fit of the Acd as a function of the descriptor 
distance, and compute for each case the associated cor¬ 
relation coefficient r € [0,1]. The first set of fingerprints 
has a correlation r = 0.55 and the second r = 0.84. 

Another point is to estimate the necessary and suffi¬ 
cient number of functions to describe the configurations. 
Figure [5] shows the evolution of the correlation coeffi¬ 
cients r when adding G^ functions. We fix i?cut = 8.52 A, 
i?s = 0 A, and consider additional values of ry obtained 
as ryi = 1, 772 = 773 = ...until 7713 = ^.. Then we 

choose the same values of 77 for Rg = 2 . A. This makes a 
total of 26 functions. We add the descriptors one by one, 
perform the least square fit for each corresponding finger¬ 
print and study the evolution of the correlation coefficient 
r. We first observe an increasing correlation as expected, 
but after a critical number of functions, r decreases or re¬ 
mains stable. We also note an increase of the correlation 
when adding the first symmetry functions with Rg = 2 A. 
Other functions should be added, such as G^ functions, to 
increase the representation capacity of the set of descrip¬ 
tors. The decrease of correlation may seem surprising at 
first. It is due to the fact that lower values of 77 tend to 




generate functions describing the environment far from 
the central particle. On the other hand, the Acd has 
been implemented with a cut-off function as described 
in section Bl Bl As a result, the description of the envi¬ 
ronment near the cut-off radius i?cut is less correlated to 
the Acd, hence the decreased correlation. This point is 
important in practice for several applications. Typically 
for numerical potentials, using symmetry functions de¬ 
scribing the environment near the cut-off radius is not 
relevant since nearer neighbors have a stronger influence 
on the potential. 



Figure 8: Evolution of the correlation coefficient r when 
adding functions 

Let us now discuss how this procedure can be used to 
optimize the set of parameters used for the descriptors. 
As a simple application, we aim at optimizing the param¬ 
eter r] in (TTSll for only one function. We fix = 0 A, 
.Rcut = 8.52 Aand study the correlation coefficient r as a 
function of rj in Figure [HI The graph shows a maximal 
value around rj* = 0.0381, which corresponds to a Gaus¬ 
sian of standard deviation 3.6. It can be understood as a 
typical length to describe the system around its central 
atom. The correlation for low values of a is still correct, 
but it drastically decreases when rj becomes too large. 
This can be explained by the fact that the variance of 
the Gaussian tends to zero when rj becomes large, so the 
G^ function takes values close to 0 whatever the environ¬ 
ment, and does not describe the environment anymore. 
The correlation graph associated with the optimal value 
rj* is displayed in Figure (TUI Even though the correlation 
is good, we observe as in EigurejHlthat for some couples 
of configurations, the descriptor distance is 0 whereas the 
Acd is not, which is a crucial problem. It suggests that 
our descriptor should also maximize the quantity 

min ||$(Q)-<h(Q)||. 

Several tracks can therefore be followed when Gi and Gj 
do not belong to the same configuration to optimize a 
fingerprint. First, a criterium to optimize should be cho¬ 
sen (maximizing the correlation or minimizing the lack of 
injectivity of $). As a second step, the set of symmetry 


functions can be optimized either by a greedy technique 
(adding the functions one by one with associated opti¬ 
mized parameters) or by defining a set of functions and 
optimizing all parameters together. The first option does 
not provide the optimal set of functions, but is easier to 
implement. The second one possibly requires to explore a 
large set of parameters, and more advanced optimization 
methods should be investigated. 



Figure 9: Evolution of correlation coefficient r with rj 



Eigure 10: Correlation between Acd and descriptor 
distance for one optimized G^ function 


To summarize, the Acd is a way to quantify the con¬ 
servation of information when using a descriptor. It can 
be used to indicate if functions are relevant to represent 
a given set of structures. It also allows to quantify the 
improvement in the representation obtained by adding a 
new describing function, and to optimize the parameters 
in the descriptors. This is particularly important in the 
context of Machine Learning, since the dimension of the 
fingerprint m {i.e. the number of functions in the rep¬ 
resentation) is the dimension of the interpolation space 
for the methods (Neural Networks or Kernel methods). 
Reducing the size of the input space R"* is of crucial im¬ 
portance to increase the efficiency of this interpolation 
(or learning) procedure. Lower values of m also allow 
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to decrease the computational cost of evaluating the nu¬ 
merical potential, which is signihcant in order to perform 
faster Molecular Dynamics simulations. 


B. Structural Analysis 

The goal of this section is to demonstrate in a sim¬ 
ple case that our method can be used to identify the 
local structure of a material. We restrict ourselves to 
the structural analysis of a crystal with one chemical el¬ 
ement, but extensions to crystals with several elements 
or even molecules are straightforward. For the testing 
procedure, we gather a database of crystalline structures 
Ci at 0 Kelvin (at a given density, see Appendix [F] for 
more details on the generation of this database): 

• Body Centered Cubic (BCC), 

• Face Centered Cubic (FCC), 

• Simple Cubic (CS), 

• Hexagonal Close Packed (HCP) 


over the distance to FCC. All distances are larger than 
the distance to FCC, but with an ordering. Indeed, the 
distance to HCP and BCC is higher than the distance to 
FCC typically by a factor 1.5 or 2. On the other hand, 
distances to the other elements up to 4 times higher than 
the distance to FCC. This can be explained by the chem¬ 
ical similarity of HCP and BCC with the FCC structure. 
The chosen descriptor fails to provide this detailed study, 
since its distance to HCP is the highest one. 


o 

o 
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0.02 

- 1 -- 1 - 1 


Sn Diamond 

0.018 

CS 

0.016 

• • 
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=cc 



0.004 
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Descriptor Distance 
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• diamond, 

• liquid, 

• Sn/3 (/3 structure of tin). 

We next compare a structure C of FCC at 100 Kelvins ob¬ 
tained by molecular dynamics simulation to each element 
of the database Ci, by computing the distance in the fin¬ 
gerprints space ||'i>(C') — $((7^)11 and Acd d 2 {C,Ci). We 
use the fingerprint described in Appendix IeI Figure [TT] 
represents (P{C, Ci) as a function of ||4)(C) — 4>(C'i)|| for 
each reference structure Ci. For both Acd and finger¬ 
prints, the structure at 100 Kelvin is closer to FCC than 
to any other structure. This shows that Acd is efficient 
in recognizing the local structure of a material, and it also 
validates the choice of fingerprint used for the recognition 
of these environments. In order to better understand the 
influence of the smearing parameter a, we display in Fig¬ 
ure the evolution of the Atomic Configuration Dis¬ 
tance as a function of cr. The numerical values vary but 
the conclusion remains unchanged: the distance to the 
FCC structure is always the lowest. When a decreases, 
the accuracy of the distance increases, so small variations 
of the structure due to thermal fluctuations tend to be 
understood as a structural difference. On the other hand, 
when a is large, the precision is very low (or equivalently, 
the representing densities are very smooth), and all the 
configurations tend to be recognized as equivalent. The 
asymptotic behavior when a tends to 0 is also displayed; 
the distance scales as as predicted by the analy¬ 

sis performed in Appendix [C] The same behavior is ob¬ 
served for larger values of a. Lastly, this distance induces 
a classification of similarity between structures. Indeed, 
Figure [T51 shows the distance to each reference structure 


Figure 11: Distance from structure C to reference 
structures, a = 1 



O 


Figure 12: Atomic Configuration Distance to each 
reference structure as a function of a 


V. CONCLUSION 

Our goal in this work was to compare atomic config¬ 
urations for various purposes, such as structure recog¬ 
nition or Machine Learning methods. To this end, we 
introduced a formal description of what we call atomic 
environment and atomic configuration, which provides a 
firm mathematical ground for defining a distance between 
conhgurations. Then, we showed that an environment 
can be represented as a permutation invariant density of 
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Figure 13: Atomic Configuration Distance to each 
reference structure normalized by the distance to FCC, 
as a function of a 


probability, the Frac. This consideration naturally led 
us to define a permutation invariant distance between 
atomic environments of an arbitrary number of atoms ni 
and 712- In particular, the Gaussian case provides an ana¬ 
lytical distance whose computational cost scales as nin 2 - 
We showed that taking the minimum over rotations of 
this distance creates a permutation and rotation invari¬ 
ant Atomic Configuration Distance (Acd). This means 
that a distance can be defined between two configura¬ 
tions. The significant difference with previous method¬ 
ologies (notably Rmsd) is that the Acd is intrinsically 
permutation invariant, so that performing the minimum 
over permutations is unnecessary, and comparisons of en¬ 
vironments with hundreds of atoms become tractable. 
This is of paramount importance for condensed matter 
systems. 

Many applications can be envisioned. We considered 
two of them as an example. First, we studied the ca¬ 
pability of fingerprints to reproduce the configurational 
information of a structure. This is of critical importance 
in several fields, in particular Machine Learning methods 
for numerical potentials. We showed that the introduced 
distance does not answer the issue of the choice of the 
descriptors by itself, but it provides a quantitative assess¬ 
ment of the improvement in the description provided by 
an additional representing function. This allows typically 
to implement greedy techniques, where parameters of a 
new function are optimized in order to obtain the maxi¬ 
mal correlation r between the Acd and the descriptor’s 
distance. A second application is structural analysis. By 
comparing a local structure to a reference data set of 
structures at 0 Kelvin, we showed that the Acd m is 
able to recognize the structure of a material at a positive 
temperature. Let us however mention that our methodol¬ 
ogy is not a way to characterize a structure from scratch. 
If the studied structure is not in the database, it will not 
be recognized. 

Finally, we want to emphasize that we described the 
presented tools in a quite general and abstract way to 


allow for various applications. At several points of the 
derivation of the distances, choices can be made (such 
as the functions for representing the presence of an atom 
in the Frac or the norm to compare densities). More¬ 
over, other applications can be considered: comparing 
molecules, testing other fingerprints, or using directly 
the similarity S defined in (1131) as a Kernel, as for the 
SoApi^, (but taking the minimum over rotations rather 
than integrating over them and using the analytical ex¬ 
pression m)- Another important issue is the sparsifi- 
cation, i.e. the construction of a database minimizing 
the redundancy of information. Indeed, in the context 
of numerical potentials, it is crucial to have the smallest 
possible database to decrease the computation cost of the 
potential. 

Appendix A: Proof that ACD is a Distance 

We here prove that the application defined in (fT^ is 
a distance on C. We remind that to be a distance on C, 
d 2 must verify, for any configurations Ci, C 2 , Cg C, the 
following properties: 

• d2(Ci,C2) =d2(C2,Ci), 

• '^(C'l, ( 72 ) = 0 if and only if Ci = C 2 , 

• d 2 (Cl, C 2 ) < d 2 (Cl, C) + d 2 (C, C 2 ). 

First, d 2 being invariant by permutation, and since the 
rotation can be taken over Ci or C 2 in m, the defini¬ 
tion of the distance is easily seen to be independent of 
the choice of the representing environments Ci £ Ci and 
C 2 £ C 2 . Secondly, the optimization problem always has 
a solution since the space of rotations TZ is compact and 
the application i? ^2 (Ci,i?C 2 ) is continuous. There¬ 
fore, the application is well-defined on C x C. Now, the 
triangle inequality results from the isomorphism with the 
associated densities pi, p 2 - More precisely, for another 
configuration C G C, with a representing element C G C 
and associated density p, 

\\pi - P 2 \\l^ < \\pi - pWl^ + \\p - P2IU2, 

So, for any rotation R, 

d2{Ci,RC2) < d2{Ci,C) + d2{C,RC2). 

Moreover, given that C £ C is arbitrary, we can choose 
it such that d 2 (Ci,C) = inf_c? 2 (Ci,C'), which leads to 

C'eC 

d 2 (Ci, RC 2 ) < d 2 (Ci, C) + d 2 {C, RC 2 ). 

Now, by taking the infimum over rotations R, first on the 
left hand side, and then on the right side, we obtain the 
triangle inequality 

d 2 (Cl, C 2 ) < d 2 (Cl, C) + d 2 (C, C 2 ). 


II 









The last point is to prove that Ci = C 2 if and only if 
d 2 {Ci,C 2 ) = 0. First, it is clear that d 2 {C,C) = 0 for 
any C £ C. Now, take Ci, C 2 £ C and Ci £ Ci, C 2 £ C 2 
such that d 2 {Ci,C 2 ) = 0. This implies 

inf d 2 (Cl, i?C' 2 ) = 0. 


Therefore, there exists Rq £TZ such that ^2 (Ci, i?oC 2 ) = 
0, and given that ^2 is a distance up to a permutation, 
there exists ao £ S such that Ci = aoRoC 2 , i-e. Ci and 
C 2 belong to the same class and Ci = C 2 - 


Appendix B: Analytical Computation of the overlap matrices in the Gaussian case 

We prove formulas (fT^ and (fHl) . First, 


S{pi,P2) 


P 1 P 2 = 


nin2 


EE 

i=i i=i 


<( 9 ) <'( 9 ) dq. 


Any integral in the double sum can be analytically computed as 


<(9)¥^"'(9)dg = 


(2707 


2'|3 


exp - 


2a2 L 


(9-9.)' + (9-9'f 


d9 


L (25^ (“2^ - 2, -- 2, -+ ,»)) d, 

92 + 9;V (92 + 9;)^ 9?+ 9; 


Ir3 ( 271(7 

1 

( 271 ( 72)3 


2'|3 


exp -- 


9 - 


d9 


exp-y 9 - 


/R3 


9i + 9d 


d9 


exp ( (* - 'Ij] 


Finally, the identity 



^ dg = (ct^tt)^/^ 


gives the desired formula. 

Appendix C: Asymptotics of Acd 


We study the asymptotic behavior of formula USD 
when (7 —>■ 0 and cr —>■ +cx). We rewrite the overlap 
factor as 


5'(cr) 


{na 


2^-; 


8nin2 


ni n2 

EEe^P 

i=i j=i 


4a2 J ’ 


where S is considered as a function of a and the two 
environments Ci = (qi)^di, C 2 = {q'i)^^^ £ C are fixed. 
When a — +(X), it holds for all i, j, 


Given that distances are obtained by taking the square 
root of overlap factors, the AcD scales as when 

a —)■ + 00 . 

In the case ct —>■ 0, one has to distinghish between 
the cases qi ^ g', for which exp ^ converges 

to 0 very fast; and qi = q', in which case this factor is 
always equal to 1. Since in the definition of the Acd 
distance (fTUl) there are two overlap factors involving the 
same density, the dominant terms in are the elements 
in the double sums S{pi,pi) and S{p 2 ,P 2 ) for which i = 
j, and they are equal to 1. The asymptotic behavior is 
then determined by the diverging prefactor, which shows 
that the Acd scales as when ct —>■ 0. 


exp 


(92^\ 

4a2 J 


-^ 1 . 

cr—>-+oo 


Appendix D: Simulated Annealing 


As a result. 


S{a) 


(710-2) 




+00 8nin2 


We shortly describe here the simulated annealing 
method^^i^l used to solve the optimization problem 
in m- We use a Metropolis Random Walk with Gaus¬ 
sian proposals^, together with a schedule for tempera- 


12 






















ture decrease. We also decrease the variance of the pro¬ 
posals in order to maintain the acceptance rate roughly 
constant. We denote 6 = the set of angles 

parametrizing the rotation. We choose a starting point 
9o, ^0 > 0, Ao > 0, a decrease rate r. The iteration 
proceeds as follows for each fc > 1: 

• define 9k+i = 9k + ikGk, where Gk ^ Af{0,ld) 
is a standard 3-dimensional Gaussian vector with 
identity covariance, 

• compute 

a{9k+i,9k) = exp 

and r{9k+i) = min (l,a(0fc+i)), 

• accept the move with a probability r(9k+i) and set 
9k+i = 9k+i in this case; otherwise reject the move 
and set 9k+i = 9k, 

• set Afc+i = rAfe and ^fc+i = r^fe. 

We fix the number of iterations (typically 1000 steps), 
and run the algorithm from various starting points 9o on 
a uniform grid, typically 2 or 3 points in each direction. 
We use r = 0.99. The initial values and Aq are choosen 
such that the acceptance rate remains of order 60 — 70% 
at each stage. Typical values are ■Co = 1 and Aq = 10“^. 
The value of Aq is derived from the following computation 

J{9,) - J{9o) - ^oGo ■ VJ{9o), 

which suggests to choose Aq ^ol|V^(^o)ll- Another 
option could be to use a single starting point but perform 
sequences of heating-cooling, i.e. increasing-decreasing 
Afe and ^k to explore the different parts of the domain. 

Appendix E: Symmetry functions 

We fix the values of the parameters ry =0.3 and 
7?cut=8.52 A for the set of G^ symmetry functions used 
as a descriptor in the applications presented in Sect ion ITVl 
(correlation graph and structural analysis). The remain¬ 
ing Rs parameters take the values 1.5, 2.0, 2.2, 2.4, 2.6, 
2.8, 3.0, 3.5 A. These functions correspond to Gaussian 
functions centered at different radiuses to represent the 
radial information of a configuration. 

Appendix F: Generation of databases 

We give here precisions on the generation of the 
databases used for the numerical applications. First, we 
used in section IV-A a database given by MD simulations 
of a Lennard-Jones fluid. Secondly, a reference database 
of structure was generated for the structural analysis per¬ 
formed in section IV-B, and another MD simulation was 
realized to compute a configuration of CFG at 100 Kelvin. 


1. Fluid 

The configurations of fluid were generated through a 
MD simulation of a Lennard-Jones NVT system at 1000 
Kelvin. Langevin equations are used with a truncated 
and shifted Lennard-Jones potential Vlj. The values of 
the parameters are e = 1.6567944 x 10“^^J and a = 3.405 
A, reproducing thermodynamic properties of Argon. A 
cut-off radius i?cut = 2.5(T was employed. The timestep 
for integration is equal to 5 x 10”^® s, and the dynamics 
is integrated over 20000 time steps. 

Finally, the conhgurations are obtained by recentering 
the environment on each atom and using the periodic 
conditions to define its neighbors. 


2. Reference Database of Structures 

The environments describing crystals (GFG, BCG, etc) 
at 0 Kelvin are well known. The issue is to use structures 
with a correct density, since comparing structures at dif¬ 
ferent densities is irrelevant in our case. Indeed, identical 
structures at different densities will be considered as dif¬ 
ferent by the Acd. As a result, we use GFG as a reference 
structure, and its lattice constant is chosen such that the 
structure has a minimal energy, or equivalently a zero 
pressure. The equilibrium density is 1.8025 kg.m“^. The 
other structures are generated with the same density (lat¬ 
tice parameters are adjusted since the shape of the unit 
cell and the number of atoms per unit cell change). For 
a GFG structure at 100 Kelvin, the same MD simulation 
is run over 20 000 steps, but at 100 Kelvin. 

Note that in addition to the invariance by permuta¬ 
tion and rotation, we could here add an invariance by 
dilatation of an environment. As a result we could search 
for the dilatation that best allows to match two environ¬ 
ments. Nevertheless, this would add a degree of search 
(and therefore increase the computational cost) whereas 
the physical approach of keeping the density fixed is 
straightforward and inexpensive. 


REFERENCES 

^E. Karakoc, A. Cherkasov, and S. C. Sahinalp, Bioin¬ 
formatics 22, 243 (2006) 

^S. Goedecker, J. Chem. Phys. 120, 9911 (2004) 

^M. Amsler and S. Goedecker, J. Chem. Phys. 133, 
224104 (2010) 

^K. T. Schiitt, H. Glawe, F. Brockherde, A. Sanna, K. R. 
Muller, and E. K. U. Gross, Phys. Rev. B 89, 205118 
(2014) 

®A. Sadeghi, S. A. Ghasemi, B. Schaefer, S. Mohr, M. A. 
Till, and S. Goedecker, J. Chem. Phys. 139, 184118 
(2013) 

®A. R. Oganov and M. Valle, J. Chem. Phys. 130, 
104504 (2009) 


13 


^L.-F. Arsenault, A. Lopez-Bezanilla, O. A. von Lilien- 
feld, and A. J. Millis, Phys. Rev. B 90, 155136 (2014) 
®K. Hansen, G. Montavon, F. Biegler, S. Fazli, M. Rupp, 
M. Scheffler, O. A. von Lilienfeld, A. Tkatchenko, 
and K.-R. Muller, J. Chem. Theory Comput. 9, 3404 
(2013) 

®A. Seko, A. Takahashi, and 1. Tanaka, Phys. Rev. B 
90, 024101 (2014) 

J. Szlachta, A. P. Bartok, and G. Csanyi, Phys. 
Rev. B 90, 104108 (2014) 

Behler, J. Phys.: Gondens. Matter 26, 183001 
(2014) 

M. Handley and J. Behler, Eur. Phys. J. B 87 (2014) 
^^A. P. Bartok, M. G. Payne, R. Kondor, and G. Csanyi, 
Phys. Rev. Lett. 104, 136403 (2010) 

^^A. P. Bartok, R. Kondor, and G. Csanyi, Phys. Rev. B 
87, 184115 (2013) 

^®M. Rupp, A. Tkatchenko, K.-R. Muller, and O. A. von 
Lilienfeld, Phys. Rev. Lett. 108, 058301 (2012) 

Behler and M. Parrinello, Phys. Rev. Lett. 98, 
146401(2007) 

i7j. Behler, J. Chem. Phys. 134, 074106 (2011) 

^®P. Steinhardt, D. Nelson, and M. Ronchetti, Phys. Rev. 
B 28, 784 (1983) 

^®A. P. Bartok, Gaussian Approximation Potential: 
an interatomic potential derived from first principles 
Quantum Mechanics, Ph.D. thesis, Cambridge (2009) 


R. Flower, J. Chem. Inf. Comp. Sci. 38, 379 (1998) 
Kabsch, Acta Cryst A 32, 922 (1976) 

A. Coutsias, C. Seok, and K. A. Dill, J. Comput. 
Chem. 25, 1849 (2004) 

L. Theobald, Acta Cryst. A 61, 478 (2005) 

^^M. E. Rose, Elementary theory of angular momentum 
(Courier Corporation, 1995) 

A. Varshalovich, A. Moskalev, and V. Khersonskii, 
Quantum theory of angular momentum (World Scien¬ 
tific, 1988) 

^®P. J. Van Laarhoven and E. H. Aarts, Simulated An¬ 
nealing: Theory and Applications, Mathematics and its 
Applications, Vol. 37 (Springer Science & Business Me¬ 
dia, 1987) 

C. Spall, Introduction to Stochastic Search and Op¬ 
timization: Estimation, Simulation, and Control, Se¬ 
ries in Discrete Mathematics and Optimization, Vol. 65 
(John Wiley & Sons, 2005) 

^®B. W. Silverman, Density Estimation for Statistics and 
Data Analysis, Vol. 26 (CRC press, 1986) 

^®W. Hardle, Nonparametric and Semiparametric Models 
(Springer Science & Business Media, 2004) 

Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, 
A. H. Teller, and E. Teller, J. Chem. Phys. 21, 1087 
(1953) 


14 


